Rio Grande do Norte
- Europe > Austria > Vienna (0.14)
- North America > United States > California > Los Angeles County > Long Beach (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (14 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (0.93)
- Information Technology > Security & Privacy (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting
Gorur, Deniz, Rago, Antonio, Toni, Francesca
Judgmental forecasting is the task of making predictions about future events based on human judgment. This task can be seen as a form of claim verification, where the claim corresponds to a future event and the task is to assess the plausibility of that event. In this paper, we propose a novel multi-agent framework for claim verification, whereby different agents may disagree on claim veracity and bring specific evidence for and against the claims, represented as quantitative bipolar argumentation frameworks (QBAFs). We then instantiate the framework for supporting claim verification, with a variety of agents realised with Large Language Models (LLMs): (1) ArgLLM agents, an existing approach for claim verification that generates and evaluates QBAFs; (2) RbAM agents, whereby LLM-empowered Relation-based Argument Mining (RbAM) from external sources is used to generate QBAFs; (3) RAG-ArgLLM agents, extending ArgLLM agents with a form of Retrieval-Augmented Generation (RAG) of arguments from external sources. Finally, we conduct experiments with two standard judgmental forecasting datasets, with instances of our framework with two or three agents, empowered by six different base LLMs. We observe that combining evidence from agents can improve forecasting accuracy, especially in the case of three agents, while providing an explainable combination of evidence for claim verification.
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- Europe > United Kingdom > England > Greater London > London (0.04)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
- (11 more...)
- Europe > Austria > Vienna (0.14)
- North America > United States > California > Los Angeles County > Long Beach (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (14 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (0.93)
- Information Technology > Security & Privacy (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Sea level rise could plunge 100 MILLION buildings underwater, warn scientists - so, is your home at risk?
AOC hit by shockingly crude sex insult by White House after she mocked'TINY' Stephen Miller Biden ordered CIA cover-up of his'corrupt' business ties to Ukraine, astonishing secret files show NYC girls aged 12 and 13 meet tragic end after going subway surfing across Williamsburg Bridge at 3.10am ERIC TRUMP: The darkest day in my dad's marriage to Melania... before the ugly truth was exposed More girls are starting their periods younger than ever before - scientists think they've finally found what's causing it Taylor Swift reveals truth behind raunchy song about Travis Kelce's manhood Meghan is accused of'giggling as model stumbles on the catwalk': More Paris Fashion Week disasters emerge, including awkward moment with Kristin Scott Thomas The TRUTH to the doting mother who slaughtered her children and husband told by those she'd been quietly tormenting for years The troubled background of delivery man stabbed by Mark Sanchez... as he launches million-dollar lawsuit and sparks civil war at Fox Revealed: Which slimming jab REALLY works best. The doctors' ultimate expert guide on which to pick, how to save money, beat every side effect... and what you need to know about the'golden dose' I haven't heard that name in so long' Ominous warning for humanity as birds suddenly adopt'unsettling' behavior And a humiliating lifeline: Backroom secrets of Taylor Swift and Blake Lively... after hit new song Bottled water contains dangerous levels of microplastics that lodge in vital organs and raise cancer risk', scientists warn Sea level rise could plunge 100 MILLION buildings underwater, warn scientists - so, is your home at risk? Rising sea levels could plunge more than 100 million buildings underwater by 2100, scientists have warned. The experts in Canada estimated how many buildings in Africa, Southeast Asia and Central and South America would be flooded by different sea level changes. Their assessment found that sea level rises of just 1.6 feet (0.5 metres) would flood three million buildings in the global south alone.
- Asia > Southeast Asia (0.24)
- Europe > Ukraine (0.24)
- North America > Canada > Quebec > Montreal (0.14)
- (30 more...)
- Personal (0.46)
- Research Report (0.46)
- Media > Television (1.00)
- Media > Music (1.00)
- Media > Film (1.00)
- (4 more...)
A Review on Single-Problem Multi-Attempt Heuristic Optimization
Echevarrieta, Judith, Arza, Etor, Pérez, Aritz, Ceberio, Josu
In certain real-world optimization scenarios, practitioners are not interested in solving multiple problems but rather in finding the best solution to a single, specific problem. When the computational budget is large relative to the cost of evaluating a candidate solution, multiple heuristic alternatives can be tried to solve the same given problem, each possibly with a different algorithm, parameter configuration, initialization, or stopping criterion. The sequential selection of which alternative to try next is crucial for efficiently identifying the one that provides the best possible solution across multiple attempts. Despite the relevance of this problem in practice, it has not yet been the exclusive focus of any existing review. Several sequential alternative selection strategies have been proposed in different research topics, but they have not been comprehensively and systematically unified under a common perspective. This work presents a focused review of single-problem multi-attempt heuristic optimization. It brings together suitable strategies to this problem that have been studied separately through algorithm selection, parameter tuning, multi-start and resource allocation. These strategies are explained using a unified terminology within a common framework, which supports the development of a taxonomy for systematically organizing and classifying them.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > Canada > Ontario > Toronto (0.14)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- (21 more...)
- Overview (1.00)
- Research Report (0.82)
- Health & Medicine > Therapeutic Area > Neurology (0.93)
- Transportation (0.68)
- Energy (0.67)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization
Rahman, Sheikh Md Mushfiqur, Eisty, Nasir
Context: Deep Neural Networks (DNNs) are increasingly deployed in critical applications, where resilience against adversarial inputs is paramount. However, whether coverage-based or confidence-based, existing test prioritization methods often fail to efficiently identify the most fault-revealing inputs, limiting their practical effectiveness. Aims: This project aims to enhance fault detection and model robustness in DNNs by integrating Learning-Based Testing (LBT) with hypothesis and mutation testing to efficiently prioritize adversarial test cases. Methods: Our method selects a subset of adversarial inputs with a high likelihood of exposing model faults, without relying on architecture-specific characteristics or formal verification, making it adaptable across diverse DNNs. Results: Our results demonstrate that the proposed LBT method consistently surpasses baseline approaches in prioritizing fault-revealing inputs and accelerating fault detection. By efficiently organizing test permutations, it uncovers all potential faults significantly faster across various datasets, model architectures, and adversarial attack techniques. Conclusion: Beyond improving fault detection, our method preserves input diversity and provides effective guidance for model retraining, further enhancing robustness. These advantages establish our approach as a powerful and practical solution for adversarial test prioritization in real-world DNN applications.
- North America > United States > Tennessee > Knox County > Knoxville (0.14)
- South America > Brazil > Rio Grande do Norte > Natal (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Asia > Thailand > Chiang Mai > Chiang Mai (0.04)
Even More Kawaii than Real-Person-Driven VTubers? Understanding How Viewers Perceive AI-Driven VTubers
Wei, Yiluo, He, Yupeng, Tyson, Gareth
VTubers, digital personas represented by animated avatars, have gained massive popularity. Traditionally, VTubers are operated and voiced by human controllers known as Nakanohito. The reliance on Nakanohito, however, poses risks due to potential personal controversies and operational disruptions. The emergence of AI-driven VTubers offers a new model free from these human constraints. While AI-driven VTubers present benefits such as continuous operation and reduced scandal risk, they also raise questions about authenticity and audience engagement. Therefore, to gain deeper insights, we conduct a case study, investigating viewer perceptions of Neuro-sama, the most popular AI-driven VTuber with 845k followers on Twitch and 753k followers on YouTube. We analyze 108k Reddit posts and 136k YouTube comments, aiming to better understand viewer motivations, how AI constructs the virtual persona, and perceptions of the AI as Nakanohito. Our findings enhance the understanding of AI-driven VTubers and their impact on digital streaming culture.
- North America > United States > New York > New York County > New York City (0.05)
- Asia > China > Hong Kong (0.04)
- Asia > China > Guangdong Province > Guangzhou (0.04)
- (11 more...)
- Information Technology (0.92)
- Media (0.70)
- Leisure & Entertainment > Games (0.46)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.45)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.34)
Centralized vs. Federated Learning for Educational Data Mining: A Comparative Study on Student Performance Prediction with SAEB Microdata
The application of data mining and artificial intelligence in education offers unprecedented potential for personalizing learning and early identification of at-risk students. However, the practical use of these techniques faces a significant barrier in privacy legislation, such as Brazil's General Data Protection Law (LGPD), which restricts the centralization of sensitive student data. To resolve this challenge, privacy-preserving computational approaches are required. The present study evaluates the feasibility and effectiveness of Federated Learning, specifically the FedProx algorithm, to predict student performance using microdata from the Brazilian Basic Education Assessment System (SAEB). A Deep Neural Network (DNN) model was trained in a federated manner, simulating a scenario with 50 schools, and its performance was rigorously benchmarked against a centralized eXtreme Gradient Boosting (XGBoost) model. The analysis, conducted on a universe of over two million student records, revealed that the centralized model achieved an accuracy of 63.96%. Remarkably, the federated model reached a peak accuracy of 61.23%, demonstrating a marginal performance loss in exchange for a robust privacy guarantee. The results indicate that Federated Learning is a viable and effective solution for building collaborative predictive models in the Brazilian educational context, in alignment with the requirements of the LGPD.
- South America > Brazil > Rio Grande do Norte (0.04)
- Asia > Middle East > Saudi Arabia (0.04)
- Research Report > New Finding (1.00)
- Workflow (0.93)
- Information Technology > Security & Privacy (1.00)
- Education > Assessment & Standards > Student Performance (1.00)
- Education > Educational Setting > Higher Education (0.93)
A Robust Pipeline for Differentially Private Federated Learning on Imbalanced Clinical Data using SMOTETomek and FedProx
Federated Learning (FL) presents a groundbreaking approach for collaborative health research, allowing model training on decentralized data while safeguarding patient privacy. FL offers formal security guarantees when combined with Differential Privacy (DP). The integration of these technologies, however, introduces a significant trade-off between privacy and clinical utility, a challenge further complicated by the severe class imbalance often present in medical datasets. The research presented herein addresses these interconnected issues through a systematic, multi-stage analysis. An FL framework was implemented for cardiovascular risk prediction, where initial experiments showed that standard methods struggled with imbalanced data, resulting in a recall of zero. To overcome such a limitation, we first integrated the hybrid Synthetic Minority Over-sampling Technique with Tomek Links (SMOTETomek) at the client level, successfully developing a clinically useful model. Subsequently, the framework was optimized for non-IID data using a tuned FedProx algorithm. Our final results reveal a clear, non-linear trade-off between the privacy budget (epsilon) and model recall, with the optimized FedProx consistently out-performing standard FedAvg. An optimal operational region was identified on the privacy-utility frontier, where strong privacy guarantees (with epsilon 9.0) can be achieved while maintaining high clinical utility (recall greater than 77%). Ultimately, our study provides a practical methodological blueprint for creating effective, secure, and accurate diagnostic tools that can be applied to real-world, heterogeneous healthcare data.
- North America > United States > New York > New York County > New York City (0.05)
- South America > Brazil > Rio Grande do Norte > Natal (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
AI-driven Orchestration at Scale: Estimating Service Metrics on National-Wide Testbeds
Moreira, Rodrigo, Pasquini, Rafael, Martins, Joberto S. B., Carvalho, Tereza C., Silva, Flávio de Oliveira
Network Slicing (NS) realization requires AI-native orchestration architectures to e fficiently and intelligently handle heterogeneous user requirements. To achieve this, network slicing is evolving towards a more user-centric digital transformation, focusing on architectures that incorporate native intelligence to enable self-managed connectivity in an integrated and isolated manner. However, these initiatives face the challenge of validating their results in production environments, particularly those utilizing ML-enabled orchestration, as they are often tested in local networks or laboratory simulations. This paper proposes a large-scale validation method using a network slicing prediction model to forecast latency using Deep Neural Networks (DNNs) and basic ML algorithms embedded within an NS architecture evaluated in real large-scale production testbeds. It measures and compares the performance of di fferent DNNs and ML algorithms, considering a distributed database application deployed as a network slice over two large-scale production testbeds. The investigation highlights how AI-based prediction models can enhance network slicing orchestration architectures and presents a seamless, production-ready validation method as an alternative to fully controlled simulations or laboratory setups. Keywords: Network Slicing, Deep Neural Networks, Machine Learning, Service-Level Agreement, Distributed Database1. Introduction Modern applications require challenging behaviors from physical networks to satisfy stringent requirements such as ultra-reliability, low latency, and high throughput [1]. In addition to these quantifiable network requirements, it is necessary to incorporate seamless, intelligent, and pervasive network capabilities to satisfy user demands [2, 3]. Although network management, control planes, and data planes have evolved to address this issue, challenges remain and require further large-scale evaluation. Many approaches, technologies, and methods have been developed to build user-oriented network architectures that provide connectivity in an isolated and personalized manner [4]. One key technological enabler of this vision is network slicing, which establishes network connectivity on top of physical infrastructure while ensuring isolation, end-to-end connectivity, and application-driven requirements, with dedicated control and data planes [5]. With this service-tailoring capability, Machine Learning (ML) e ffectively addresses various management and orchestration challenges, thereby enabling intelligent and real-time insights for service provider managers. AI techniques, such as reinforcement learning, supervised learning, and unsupervised learning, have been e ff ectively integrated with network orchestrators to mitigate cybersecurity threats, enable intelligent resource allocation, and ensure Service-Level Agreement (SLA) assurance for network slicing [7, 8, 9, 10].
- North America > United States > Missouri > Jackson County > Kansas City (0.14)
- South America > Brazil > São Paulo (0.04)
- South America > Brazil > Santa Catarina (0.04)
- (9 more...)
- Education (0.86)
- Information Technology > Security & Privacy (0.86)
- Government > Military > Cyberwarfare (0.54)